Resolving Demonstrative Anaphora in the TRAINS93 Corpus
نویسندگان
چکیده
This paper reports the results of a first-pass annotation exercise of third-person pronouns in the TRAINS spoken dialog corpus. Our research goal is to disambiguate demonstrative pronouns that/this, and the annotation project is undertaken with the intent to find surface features that can be used to strongly prefer/disprefer certain potential referents. Pragmatic differences between demonstrative pronouns and definite pronouns are also discussed. An analysis of the annotation data uncovered many statistically significant patterns that can be used to determine when a pronoun might be referring to an entity in the global focus, such as the plan or a discourse segment, versus an item in the task domain, such as an engine. These patterns can be used to constrain the search space in a reference resolution algorithm that resolves reference to both task and global-focus entities using only easy-to-compute syntactic features.
منابع مشابه
Automation and Validation of Annotation for Hindi Anaphora Resolution
The process of labelling any language genre by which one can extract useful information is called annotation. This provides syntactic information about a word or a word phrase. In this paper, an effort has been made to provide the algorithm for semiautomatic annotation for Hindi text to cater anaphora resolution only. The study was conducted on twelve files of Ranchi Express available in EMILLE...
متن کاملAnaphora Resolution of Demonstrative Noun Phrases in Medline Abstracts
This paper reports our investigation of machine learning methods applied to anaphora resolution for Biology texts. Our primary concern is the investigation of features and their combinations for effective anaphora resolution. In this paper, we focus on the resolution of demonstrative anaphoric noun phrases. We propose several novel features that we call highlighting features and consider their ...
متن کاملLa reconnaissance automatique de la fonction des pronoms démonstratifs en langue arabe (Automatic recognition of demonstrative pronouns function in Arabic) [in French]
________________________________________________________________________________________________________ Automatic recognition of demonstrative pronouns function in Arabic Anaphora resolution is one of the most difficult tasks in NLP. Classifying pronouns before attempting a task of anaphora resolution is important because to handle the cataphoric pronoun, the system should determine the antece...
متن کاملNatural Language Processing Scientific Literature DEMONSTRATIVE ANAPHORA: FORMS AND FUNCTIONS IN FULL-TEXT SCIENTIFIC ARTICLES
This study examines the functions and characteristics of demonstrative anaphora (this, these, that, those) in a collection of full-text scientific documents, confirming that they play an important role in maintaining discourse focus and binding together cohesive sections of text. Unlike corpora in other subject domains, the Cystic Fibrosis database contains more demonstrative expressions than a...
متن کاملResolving and Generating Definite Anaphora by Modeling Hypernymy using Unlabeled Corpora
We demonstrate an original and successful approach for both resolving and generating definite anaphora. We propose and evaluate unsupervised models for extracting hypernym relations by mining cooccurrence data of definite NPs and potential antecedents in an unlabeled corpus. The algorithm outperforms a standard WordNet-based approach to resolving and generating definite anaphora. It also substa...
متن کامل